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Description 

Method for generating a bit stream from an indexing tree 

The present invention relates to a method for generating a bit 
stream from an indexing tree as well as to a coding method and 
a decoding method which make use of the method for generating 
the bit stream. The invention further relates to a coding and 
decoding device. 

It is often necessary to read out certain contents from a bit 
stream according to a query previously formulated by a user or 
to determine with regard to certain contents whether said 
contents are in fact contained in the bit stream at all. In 
this case a query defined by a user can be formulated using a 
query language such as SQL (see reference [1] ) or XPATH (see 
reference [2] ) . It is advantageous here if the entire bit 
stream does not have to be searched for the desired contents 
but instead if the information is stored in an indexing list or 
in an indexing tree, so that only that part of the bit stream 
in which the indexing tree or the indexing list is stored has 
to be searched. 

The problem of the reading out of data from a bit stream arises 
for example in the case of a document produced with the aid of 
the XML language (XML = Extensible Markup Language) and 
represented in the MPEG7 BiM format. With regard to the MPEG7 
BiM format of an XML document, reference is made in particular 
here to the document [3] . With a representation such as this 
the generated bit stream is subdivided into a plurality of 
units (access units) which consist in their turn of a plurality 
of fragments (fragment update units) . The units are coded and 
where necessary sent in the form of an MPEG7 BiM stream to one 
or more recipients. 
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With regard to the querying of information from XML documents, 
a multiplicity of query languages are already known which 
permit searches to be made in the document for specific 
information. Reference may be made at this juncture by way of 
example to the already mentioned query language XPATH (see 
reference [2]) . The query language XPATH can be used to define 
selection criteria for filtering desired information within an 
XML document. In this case the purpose of a query can be on the 
one hand to assess whether a unit of the bit stream is 
important for the recipient. On the other hand a query can be 
used in a targeted manner to access specific desired 
information in the XML document. During the generation of the 
bit stream of an XML document the MPEG7 coding method has to 
date provided no mechanisms which enable random access to 
specific elements of the XML document. The MPEG7 bit stream 
must therefore be decoded for the purpose of searching for 
elements. A document in XML format is then obtained once again, 
which document can be searched by means of the query language 
XPATH. The decoding and subsequent processing of an XML 
document in order to search for specific contents is therefore 
very time-consuming, and so unacceptable for certain time- 
critical applications. Furthermore the problem can arise that 
the memory in the decoder is limited, with the result that the 
bit stream cannot be fully decoded. In addition, the overhead 
involved in decoding was unnecessary if the XPATH query 
executed on the decoded XML document ends with a negative 
result . 

Within the framework of TV-Anytime (TVA) , which is described in 
document [4] , an index structure is used which permits random 
access to certain elements of a data fragment . The index 
structure consists of a plurality of parts and comprises what 
is referred to as a ""key index list" in which all indexed paths 
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of a document are stored- When a query is submitted, said paths 
are compared in turn with the query until a matching entry is 
found in the key index list. Based on the information that is 
stored in the key index list in relation to this entry, the 
positions in a description stream at which the indexed entry is 
present in coded form can be determined. As a result of using 
the key index list it is no longer necessary to decode 
irrelevant data fragments, so that less memory space is 
required during a query. However, the linear processing of the 
key index list is time-consuming and the transfer of all the 
indexed paths is laborious and resource - intensive . 

The object of the invention is therefore to create a method for 
generating a bit stream from an indexing tree wherein the 
search for indexed data is made easily and efficiently 
possible . 

This object is achieved according to the features of the 
independent claims. Developments of the invention may be 
derived from the dependent claims . 

The indexing tree from which a bit stream is generated by means 
of the method according to the invention comprises a 
multiplicity of hierarchy levels, each hierarchy level being 
assigned one or more index nodes and the index nodes containing 
index data which is sorted in the indexing tree according to 
one or more predetermined criteria. With the method according 
to the invention, the index data of the index nodes is inserted 
into the bit stream and for each index node there is also 
inserted into the bit stream the information indicating at 
which position in the bit stream the index data of one or more 
index nodes of the hierarchy level which lies below the 
hierarchy level of the respective node is located. As a result 
of storing the additional information relating to the index 



2002P18889WOUS / PCTEP0312098 

4 

nodes in a lower hierarchy level a search for specific index 
data is considerably simplified since by this means branches to 
the index nodes relevant to the search are made possible. 
Considerably more efficient queries and searches for index data 
are therefore guaranteed - 

In a preferred embodiment the indexing tree is structured as 
what is referred to as a ''B tree" (= Balanced Tree) which 
ensures a balanced distribution of the data over the nodes of 
the tree. A detailed description of the B tree can be found in 
reference [5] . 

The index data in the indexing tree can be sorted according to 
any criteria, lexicographically for example. 

In a particularly preferred embodiment the index data is 
inserted into the bit stream according to the depth- first 
ordering principle. Through the use of depth-first ordering the 
index data in the indexing tree is inserted into the bit stream 
initially according to depth, as a result of which the items of 
information relevant to a query are arranged adjacent to one 
another in the bit stream and information that is not relevant 
can be efficiently skipped. A detailed description of depth- 
first ordering can be found in reference [6] . 

In a further preferred embodiment of the invention the index 
data comprises paths of a document structure tree which has a 
root node and a plurality of leaf nodes. The index data 
preferably also comprises the value instances of the paths and 
the positions of the value instances in the document which is 
represented by the document structure tree. In addition, in a 
preferred embodiment the index data also includes the number of 
paths in an index node . 
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In this case the paths can be absolute paths which start from 
the root node of the document structure tree and lead to a leaf 
node of the document structure tree. In a particularly 
preferred embodiment of the invention, however, the paths are 
relative paths, a relative path of a respective index node 
being a path relative to a path, previously inserted into the 
bit stream, of the respective index node or of an index node of 
a hierarchy level above the hierarchy level of the respective 
index node. The use of relative paths enables commonalities in 
the paths to be used, for the paths of adjacent nodes usually 
have a common component. In this way the amount of memory space 
required for storing the index data in the bit stream can be 
reduced. A further reduction of the memory space requirement 
can be achieved by inserting the paths of the index node whose 
index data is the first of a hierarchy level to be inserted 
into the bit stream in reverse sequence to the sequence in 
which the index data is arranged in the index node. This takes 
into account the fact that the index data at the end of the 
first index node of a hierarchy level has a greater similarity 
to the index node of the next -higher hierarchy level than the 
index data at the beginning of the first index node. 
Consequently a particularly effective coding results in certain 
cases through the use of relative paths. 

In a particularly preferred embodiment of the invention the 
paths comprise description elements of an XML document (XML = 
Extensible Markup Language) , the paths being in particular 
XPATH paths of the XML document. 

In a further embodiment of the invention the index data is 
coded in binary format by means of a coding method, in 
particular by means of an MPEG coding method. In a particularly 
preferred embodiment the MPEG7 coding method is used as the 
coding method. 
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In addition to the above -described method for generating a bit 
stream the invention further comprises a method for coding a 
data structure wherein the data elements of the data structure 
are indexed in an indexing tree, with a bit stream being 
generated according to the inventive method and said bit stream 
being part of the coded data stream. The invention also 
comprises a method for decoding a data structure, the method 
being embodied in such a way that a data structure coded by 
means of the coding method just described is decoded. 

The invention further comprises a method for coding and 
decoding a data structure which comprises the above-described 
coding and decoding method. 

The invention also comprises a coding device by means of which 
the coding method according to the invention can be performed, 
as well as a decoding device by means of which the decoding 
method according to the invention can be performed. The 
invention further relates to a corresponding device for coding 
and decoding by means of which the above-described combined 
coding and decoding method can be performed. 

Exemplary embodiments of the invention will be described and 
explained below with reference to the drawings, in which: 

Fig. 1 shows an example of an indexing tree used in the 
method according to the inventions- 



Fig. 2 



shows a section from the indexing tree shown in Fig. 
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Fig, 3 shows a section from a bit stream which has been 

generated from the indexing tree depicted in Fig. 1 
by means of the method according to the inventions- 
Fig. 4 shows a simplified indexing tree in which the index 
data is ordered lexicographically and represents 
paths of a document structure tree; 

Fig. 5 shows the indexing tree from Fig. 4, with some of the 
paths of the second hierarchy level being represented 
as relative paths; 

Fig. 6 shows the indexing tree from Fig. 4, with all the 

paths of the second hierarchy level being represented 
as relative paths; and 

Fig. 7 shows a variation of the indexing tree according to 
Fig. 4. 

Figure 1 shows an example of an indexing tree used in the 
method according to the invention. Said tree is what is known 
as a B tree (where B stands for Balanced) which is often used 
for the indexing of data. The structure of a B tree is 
described for example in the already mentioned reference [5] . 
The B tree has a balanced structure and consequently permits 
searching with only a logarithmic overhead compared to the 
number of entries, whereas a linear overhead is required for 
searching by means of a list. 

The indexing tree contains a plurality of nodes 1 to 10, with 
one or more items of index data being stored in each node in 
the form of what are referred to as keys. In the present case 
the items of index data are XPATH paths of a description tree 
of an XML document. A description of XPATH paths can be found 
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in the already mentioned reference [2] . The XPATH paths lead 
from the root node of the description tree of the XML document 
to the individual leaf nodes of the description tree of the XML 
document. XML elements with simple content and all XML 
attributes are used as leaf nodes in the description tree used 
in this embodiment. Consequently the index data of the indexing 
tree shown in Figure 1 comprises root -leaf paths of an XML 
description tree. 

In the indexing tree shown in Fig. 1 the XPATH paths are 
represented as a series of upper- and lowercase letters and are 
ordered lexicographically, with each letter standing for an 
element name in the XML description tree. For example, the path 
MDMVUFACN in node 1 means 

MPEG7/Description/MultimediaContent/Video/UsageInf ormation/Fina 
ncialResults/Accountltem/Costtype/Name. Also stored in the 
nodes in addition to the path are the value instances in which 
this path occurs in the XML document. In the present case the 
path MDMVUFACN occurs twice in index node 1 with the instances 
"Total for Production" and "Broadcast". In contrast, the path 
MDMVUAFAc occurs only once in index node 2 with the instance 
"EUR" . 

The numbering of the index nodes in the indexing tree of Figure 
1 corresponds to the order in which the index data is inserted 
into the bit stream with the aid of the method according to the 
invention. In this case the order corresponds to what is known 
as a depth- first ordering scheme, as described for example in 
the already mentioned reference [6] . The insertion of the index 
data into the bit stream according to the depth- first ordering 
principle results in a lesser complexity during access to the 
index data, since information that is not relevant is skipped 
during the search for index data. 
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Figure 2 shows a section from the indexing tree from Figure 1 
with the aid of which the querying of index data in this tree 
will be explained using an example. In this case the query is 
whether the XPATH path 

MPEG7/Description/Multimediacontent/Video/UsageInf ormation/Avai 
lability/Dissemination/Disseminator /Agent /Name contains the 
value instance "Discovery" (in short notation MDMVUADDAN = 
"Discovery") . Since the search string MDMVUADDAN is 
lexicographically less than the XPATH path "MDMVUFACN" of index 
node 1, the search refers back to index node 2. The search 
string is in turn lexicographically less than the entry 
"MDMVUAFAc" of index node 2. For this reason a transition is 
made to index node 3 located to the left of index node 2 . Said 
index node contains two XPATH paths "MDMVUAAt" and "MDMVUADFh" . 
The search string lies between these two XPATH paths, so in the 
next hierarchy level a transition is made to index node 5 which 
lies between index node 4 and index node 6, This index node now 
contains as its first element the searched- for XPATH path 
"MDMVUADDAN" as well as the searched- for value instance 
"Discovery". Thus, the matching value instance was found in 
index node 5 and the position of the corresponding element in 
the indexed XML document is then obtained. As a result of the 
fact that the index nodes are inserted into the bit stream 
according to the depth-first ordering principle, all further 
nodes of the indexing tree are ignored or skipped during the 
search, so the search for elements in the indexing tree has 
little complexity. 

The insertion of index data into the bit stream by means of the 
method according to the invention will be explored in detail 
below. This will be done with reference to Figure 3, which 
shows a section from a bit stream generated using the method 
according to the invention. In the bit stream shown in Figure 3 
the numbers of the index nodes are listed for the sake of 
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clarity, although said numbers are not transmitted with the 
stream. The contents of index nodes 1 and 3 are shown in detail 
in Figure 3. As the first element the nodes initially contain 
the number of entries, that is to say the number of XPATH paths 
stored in the node. This results here in the number 1 for index 
node 1 and the number 2 for index node 3. Next there follows 
the key for each entry, that is to say the respective XPATH 
path as well as the number of instantiations, i.e. the number 
of value instances that said XPATH path has in the XML 
document. For index node 1, for example, this results in the 
instantiation number 2 for the XPATH path "MDMVUAFCN" , since 
the path having the two value instances "Total for Production" 
and "Broadcast" occurs in the XML document. Furthermore, for 
each value instance, the position of said value instance in the 
XML document is inserted into the bit stream. It is essential 
with the method according to the invention that there is also 
inserted into the bit stream, in addition to this information, 
the information concerning at which position in the bit stream 
- in particular at which position relative to the current 
position (offset) - the index data of index nodes of the next- 
lower hierarchy level that are connected to the respective 
index node is located. This enables nodes that are not relevant 
for a predetermined search query to be skipped. 

The nodes branching from one node into a lower hierarchy level 
are referred to below as child nodes. It should be noted with 
the method according to the invention that the information 
relating to the position of the first child node of a hierarchy 
level does not need to be stored since said child node is read 
in next in the bit stream. Thus, as can be seen from Figure 3, 
only the position of index node 10 (referenced as "offset 
(10) ") is stored for index node 1 and only the positions of the 
index nodes 5 and 6 (referenced as "offset (5) " and "offset 
(6)") are stored for index node 3. If it transpires, for 
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example, that the searched- for information for a query is 
lexicographically greater than the XPATH in node 1, a branch is 
immediately made to node 10, so that it is no longer necessary 
to search index node 2 and the index nodes branching off from 
index node 2 . By this means an effective search is made 
possible . 

The following table lists once again which information is 
stored per node in the bit stream. 

Table : 



Node { 

number of entries 

for each entry: 
key 

number of instantiations 
for each instantiation: 

value instance 

position 

for each child node except the first 
offset in the stream 



The bit stream generated by means of the method according to 
the invention is transmitted to the recipient and can be read 
out in accordance with a query which was explained by way of 
example in relation to Figure 2. By means of comparisons 
between the XPATH to be queried and the entries in the index 
nodes specific node information is read out from the stream 
until the corresponding searched- for XPATH containing the 
searched- for value instance has been found or until no matching 
XPATH path is possible any longer on account of the sorting- In 
the latter case the desired information is not contained in the 
indexed document . 
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Further embodiments of the method according to the inventions 
are explained below with reference to Figures 4 to 7, which 
illustrate simplified indexing trees with lexicographically 
ordered entries. Figure 4 shows an indexing tree which is to be 
coded by means of the method according to the invention and 
which comprises nodes Kl to K4 . In this case node Kl is 
referred to as a parent node and the nodes K2 to K4 branching 
off from node Kl are referred to as child nodes. The entries 
are paths of a document structure tree. Instead of transmitting 
the paths with their entire length in the bit stream it is 
possible to store what are known as relative paths in the index 
node to be transmitted, said paths being relative in relation 
to a previously transmitted path of the parent node or in 
relation to a previously transmitted path of the index node 
currently to be transmitted. 

The insertion of relative paths into the bit stream is 
illustrated in Figure 5. In this case the notation 
signifies a step back in the path to which the relative path 
refers. In node Kl, the relative path "..E" in relation to the 
preceding entry ACB is transmitted for the entry "AE" . In the 
child nodes Kl, K2 and K3 , the relative paths in relation to 
the first path of the respective child node are transmitted in 
each case in the second and third entries. 

With the embodiment according to Figure 5, the full absolute 
path is transmitted for the first paths AA, ACC and AEF of the 
child nodes K2 , K3 and K4 respectively. It is now additionally 
possible also to transmit the first paths of the child nodes as 
relative paths in relation to a path of the parent node Kl . 
This is shown in Figure 6. In this case, instead of the paths 
AA and ACC of the first and second child nodes, the relative 
paths "..A" and "-C" in relation to the first path of the 
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parent node are transmitted. Instead of the path "AEF" of the 
third child node, the relative path "F" in relation to the 
second path of the parent node is transmitted. In this case the 
paths are preferably stored in the bit stream in their byte 
representation . 

As a result of the use of relative paths the data volume in the 
stream is substantially reduced because commonalities in the 
paths are made use of when the paths are stored, with the 
result that less memory space is required. In particular, with 
relative paths the respective common components of the paths 
only have to be transmitted once. 

In a further embodiment of the invention a data saving is 
achieved in that in the arrangement of the entries of the index 
nodes for the first child node K2 a reverse sequence is used 
compared to Figure 4. The result of this is that the entry 
which has the greatest similarity to the preceding parent node 
Kl is located at the start of the child node K2 . Thus, with 
special embodiments of the indexing tree this can lead to 
substantially shortened relative paths and a reduced memory 
requirement . 

The paths can be the already mentioned XPATH paths of an XML 
description tree. Furthermore the paths can be binary coded, by 
means of an MPEG7 coding method for example. In particular, the 
binary coding methods specified in the already mentioned 
reference [3] can be employed. Compared with textual 
representations, the use of binary coded paths can lead to 
savings in the number of bits to be transmitted. 
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